Phonological structure in speech recognition
نویسندگان
چکیده
منابع مشابه
Speech Emotion Recognition Using Scalogram Based Deep Structure
Speech Emotion Recognition (SER) is an important part of speech-based Human-Computer Interface (HCI) applications. Previous SER methods rely on the extraction of features and training an appropriate classifier. However, most of those features can be affected by emotionally irrelevant factors such as gender, speaking styles and environment. Here, an SER method has been proposed based on a concat...
متن کاملUsing automatic speech recognition for phonological purposes:
1. INTRODUCTION Automatic speech technology offers great opportunities for investigating a wide range of issues in Laboratory Phonology (e.g. [1, 2]). However, its availability hardly extends beyond a small number of languages, and most spoken languages are thus under-resourced in this perspective. Nevertheless, an increasing effort is done to develop automatic speech recognition (ASR) for such...
متن کاملSpeech synthesis by phonological structure matching
This paper presents a new technique for speech synthesis by unit selection. The technique works by specifying the synthesis target and the speech database as phonological trees, and using a selection algorithm which finds the largest parts of trees in the database which match parts of the target tree. The technique avoids many of the errors made by prosody generation modules by incorporating th...
متن کاملTechniques For Modelling Phonological Processes In Automatic Speech Recognition
Systems which automatically transcribe carefully dictated speech are now commercially available, but their performance degrades dramatically when the speaking style of users becomes more relaxed or conversational. This dissertation focuses on techniques that aim to improve the robustness of statistical speech transcription systems to conversational speaking styles. The dissertation shows first ...
متن کاملThe status of functional phonological information in statistical speech recognition
The choice of speech production models as a basis for Automatic Speech Recognition (ASR) is often taken to have two straightforward implications for the topology of recognition systems. Firstly, it establishes a set of articulatory properties whose elements are the basic linguistic units to be extracted from the signal. Secondly, it predefines an internal classification of these properties whic...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Phonology Yearbook
سال: 1986
ISSN: 0265-8062,2059-6286
DOI: 10.1017/s0952675700000622